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@> Low-skew signal routing in a programmable array. 

(57) Providing low-skew dock signals to a Field 
Programmable Gate Array (FPGA) chip normally 
requires devoting a certain number of 
bondpads to that purpose. However, that limits 
the number of clocks that may be applied, and 
may also limit which bondpads can be used for 
that purpose. In the present invention, any in- 
put/output bondpad (eg., 238 - 241 ; 246 - 249) 
may be used to supply a low-skew clock, or 
other global type signal, to one or more of the 
Programmable Function Units (PFUs) (eg., 205 - 
213). This is accomplished by using a criss- 
crossed grid of parallel conductor groups (eg., 
254, 255). Any of the conductors may be sup- 
plied by a dock from a bondpad or alternatively 
driven directly from a PFU, thereby allowing the 
distribution of internally-generated docks. To 
facilitate programmable interconnects between 
the horizontal and vertical conductors, the out- 
er conductor in a group crosses over the others 
at defined intervals, to thereby become the 
inner conductor. In this manner, each celt (eg., 
201 - 204) may drive a subset of the conductors 
in a group, thereby redudng the number of 
drivers needed. This allows all the buffer and 
conductor topologies to be identical for each 
cell, which significantly aids in the com- 
puter-aided design of the FPGA, as well as 
simplifying its programming. 



FIG. 2 
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Background of th Invent! n 

Fi Id of the Invention 

The present inv ntion relates to programmabl 
arrays that implement logic functions on an integrated 
circuit. 

Description of the Prior Art 

In Field Programmable Gate Arrays (FPGAs), 
routing conductors are used to connect together logic 
elements, typically through programmable intercon- 
nects. A logic element may also be programmable, 
and is referred to herein as a "programmable function 
unit" (PFU), which is also known In the art as a "con- 
figurable logic block" (CLB) or "configurable logic a 
element" (CLE); see for example, U. S. patent 
4,870,302. For sequential circuits, some of these pro- 
grammable function units are storage elements (i.e. 
registers). These storage elements are generally en- 
abled to store new data through the use of a clock sig- 
nal, with each register possibly receiving a different 
clock. In most cases, FPGAs are made at least par- 
tially from internal blocks that are more or less iden- 
tical. The contents of a block includes both the PFUs 
and the dock routing noted above. This is desirable 
to facilitate the computer-aided design of the internal 
blocks themselves, and to ensure proper operation of 
interconnected blocks after the FPGA has been pro- 
grammed. 

In order to run a sequential circuit at high speed, 
the skew between a dock arriving at one PFU and the 
clock arriving at another PFU must be very small. On 
FPGAs, however, the delay introduced by routing a 
clock through a programmable interconnect causes a 
large skew between the time that the dock arrives at 
one PFU versus the time that same dock arrives at 
other PFUs. In a typical FPGA architecture as shown 
in Fig. 1 , this problem is circumvented by adding dedi- 
cated dock routing conductors to the FPGA. For ex- 
ample, the bond pads 101 and 102 are dedicated to 
receiving dock signals, and supplying these signals 
to the dedicated dock conductors 118 and 122, re- 
spectively. The conductors 118 and 122 in turn supply 
cdumn dock conductors 119, 120, 121; and 123, 124, 
125, as indicated, which drive each of the PFUs. In 
addition, the bondpads 103 and 104 similarly provide 
clocks to each PFU via other dedicated dock conduc- 
tors, which for simplicity of illustration are not shown. 
These dedicated dock routing conductors provide 
very low skew docks. 

How ver, on probl m with this implementation 
is that larg sections of these dock routing conduc- 
tors tend to be wasted if only a small number of the 
PFUs n th FPGA are connected to a particular 
clock signal. Thes dock routing conductors also tend 
to b difficult to use for routing other signals if they 



are not needed for dock distribution. A third drawback 
is that the access to drive thes clock routing conduc- 
tors from off of the chip tends to be optimized for only 
a small number of input/output (I/O) bondpads. On 
5 known method of economizing on dock distribution 
conductors is to multiplex the dock distribution lines 
from the bondpads. For example, the multiplexers 
128, 129 and 130 each receive a clock input from 
bondpads 107 and 108, via conductors 127 and 126, 
10 respectively. The multiplexers then select which dock 
to supply to each of the columns via the conductors 
130, 131 and 132. In a typical case, the bondpads 105 
and 106 also receive clock signals and supply them 
to inputs of the multiplexers via distribution lines, 
15 which for simplicity of illustration are notshown. In ad- 
dition, the inputs to the multiplexers can be supplied 
with a dock signal from other bondpads or from the 
outputs of one of the PFUs. However, these techni- 
ques do not address the limitations noted above. 
20 Therefore, it would be desirable to have an FPGA 
routing architecture that would allow for the distribu- 
tion of low skew clocks, as well as other types of glo- 
bal signals, using the same routing resources. It 
would also be desirable that all, or at least a signifi- 
es cant percentage, of I/O bondpads be approximately 
equivalent for the purpose of driving these global 
routing resources. The third need is that the global 
routing be flexible enough for each global signal to be 
able to drive anywhere from one PFU to all of the 
30 PFUs on the FPGA without wasting routing resourc- 
es. 

Summary of the Invention 

35 We have invented a technique of providing low- 
skew distribution of global type signals in an integrat- 
ed circuit programmable logic array. The present 
technique provides for a criss-crossed grid of conduc- 
tor groups. Any of the conductors may be supplied by 

40 a dock from a bondpad. At least one programmable 
cross-connect provides for connecting a conductor in 
one group to a conductor in another group. In one em- 
bodiment, the conductors are continuous across the 
logic array, so that they may be driven from either of 

45 two bondpads on opposite sides of the integrated tir- 
cuit chip. In a presently preferred embodiment, to fa- 
cilitate programmable connections between the criss- 
crossed conductors, the outer conductor in a group 
crosses over the others at defined intervals, to there- 

50 by become the inner conductor. In this manner, each 
group of conductors may more economically drive 
conductors in the other group. 

Brief Description f the Drawings 

55 

Fig. 1 shows a typical prior-art clock routing 
scheme. 

Fig. 2 shows an embodiment of the inv ntive 
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technique. 

Fig. 3 shows a typical circuit embodiment of pro- 
grammabl cross-connects used with the inventive 
technique. 

Detailed Description 

The following detailed description relates to a 
technique wherein the problems stated above are 
avoided by creating a criss-crossed grid of routing 
conductors that may be used to distribute global sig- 
nals with minimum skew in the FPGA. Referring to 
Fig. 2, a 3x3 array of programmable function units 
205 ... 213 provides an illustrative embodiment of the 
invention, with a larger size array being likely in a typ- 
ical commercial application. Each global routing con- 
ductor runs in a multi-conductor "group* located be- 
tweenThe rows and columns of t he P FUs in t he array. 
The groups are illustratively of four conductors each 
(e.g., 254 and 255), and lie along either one of two 
perpendicular axes, referred to illustratively herein as 
the "vertical" and "horizontal" axes. The vertical con- 
ductors are typically Implemented in a first conductor 
level, and the horizontal conductors are typically im- 
plemented in another conductor level separated 
therefrom by a dielectric level. The conductors are de- 
sirably metal in most cases. As illustrated, the hori- 
zontal conductors are shorter than the vertical con- 
ductors; however, in practice it is typically desirable 
that the lengths be equal. 

Two groups of vertical global routing conductors 
and two groups of horizontal routing conductors are 
illustrated. These conductors, along with the associ- 
ated PFUs 206, 207, 209 and 210, form "programma- 
ble logic cells" (PLCs) 201 , 202, 203 and 204, respeo- 
tively. Other global routing conductors may be simi- 
larly provided for PFUs 205, 208 and 211, which for 
clarity of illustration are not shown. Furthermore, the 
FPGA typically also includes other (non-global) rout- 
ing conductors according to techniques known in the 
art, also not shown. The various programmable con- 
nections between the global routing conductors and 
non-global (i.e. "local") conductors are also not 
shown. However, either type of conductor can typical- 
ly drive the other by means of the appropriate pro- 
gramming. The PFUs usually send and receive most 
signals via the local conductors, with the global con- 
ductors typically being reserved for clock signals. It is 
possible for a global conductor to send a global signal 
to all the PFUs in the array. However, a signal may 
stfll be considered "global" even if it does not go to all 
of the PFUs. A signal is considered global as used 
herein if it goes to at least 20 percent of the PFUs, and 
more typically at least 50 perc nt of th PFUs in the 
array. 

In th pres nt technique, a given global routing 
conductor is connected t a programmable input buf- 
fer on at least one end. For example, the programma- 



ble input buffers 242 ... 245 each drive th horizontal 
conductors 234 ... 237, respectively. Therefore, a 
clock signal applied to the bondpads 238 ... 241 may 
be driven onto thes conductors. Similarly, the clock 

5 signal may b applied to the bondpads 246 ... 249 on 
the opposite dge of the integrated circuit chip 200, 
so that the conductors may be driven through the pro- 
grammable input buffers 250 ... 253. As shown, th 
conductors 234 ... 237 are continuous across the chip 

10 from one set of input buffers to the other. However, 
in an alternate embodiment of the invention, the con- 
ductors extend across only a portion of the chip. For 
example, they may be discontinuous between cells 
201 and 202. In that case, they are each driven by 

is only one of the input buffers shown. However, they 
are typically continuous across at least two of the 
cells of the FPGA. It can be seen that the vertical 
routing conductors (e^., 214 ... 217) are similarly 
connected to programmable input buffers (222 ... 225 

20 and 230 ... 233), which connect to bondpads (218 ... 
221 and 226 ... 229). The vertical conductors may 
also extend continuously across the chip between th 
buffers as shown, allowing them to be driven from 
either end of the chip. Alternatively, the vertical con- 

25 ductors may be discontinuous, as between ceils 201 
and 203, for example. This arrangement gives the de- 
signer great flexibility in providing clock signals to the 
conductors, as compared to prior-art techniques. 
In the present technique, programmable cross- 

30 connects are provided so that a global routing con- 
ductor in one axis may be used to provide a global sig- 
nal to a routing conductor in the perpendicular axis. 
For example, a routing conductor in the horizontal 
axis may be used to drive one or more routing conduc- 

35 tors in the vertical axis. The one or more vertical rout- 
ing conductor may then be used to distribute the dock 
signal to a column of PFUs. Any one of the horizontal 
routing conductors in a given group may provide the 
clock signal from the bondpad. To accomplish this 

40 connection, the programmable connector 256, 257, 
258 or 259 that is associated with the horizontal con- 
ductor supplying the clock signal is programmed to 
connect to the input of cross-connect driver 260. Sim- 
ilarly, a given vertical routing conductor may be used 

45 to drive one or more of the horizontal routing conduc- 
tors, which may distribute the clock signal to a row of 
PFUs. Alternatively, a PFU output may be driven onto 
one or more of the horizontal and/or vertical conduc- 
tors, thus allowing internally-generated clocks (or 

so other global signals) to be distributed in the same 
manner. 

Since programmable connections are relativ ty 
costly for FPGAs, especially since each connection to 
global conductors typically r quires a buffer, it is cost- 
55 ly to supply programmable connections to/from all 
vertical routing conductors from/to all horizontal rout- 
ing conductors. Therefore, it is advantageous in many 
cases to provide these connections for only a subset 
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of all possible combinations between the horizontal 
and vertical routing conductors. In the embodiment of 
Fig. 2, only the conductor nearest the PFU in a given 
cell is driven from a conductor in th other axis. For 
example, in cell 201 , only horizontal line 237 is driven 5 
from the vertical group (through cross-connect driver 
265), whereas only the vertical line 21 5 is driven from 
the horizontal group (through cross-connect driver 
260). However, it is still possible to supply the dock 
signal to any of the PFUs while avoiding undesirable 1 o 
skew, and still allowing for identical cells. This is ac- 
complished by twisting each of the group of routing 
conductors in each axis inside of each cell. This al- 
lows those routing conductors that cannot be con- 
nected to the output of a buffer inside a given cell to 15 
eventually be rotated to where this routing conductor 
can be driven inside another internal cell. 

For example, as shown in Fig. 2, the conductor 
214 is on the left at the bottom of the array as viewed. 
However, the conductor 217 crosses over the other 20 
three conductors in the cell immediately below cell 
203. Therefore, 217 becomes the outer conductor 
(i.e., furthest from PFU 209) in cell 203, and 216 be- 
comes the inner conductor. The conductor 216 then 
crosses over the other three conductors within cell 25 
203, so that it becomes the outer conductor, and 215 
becomes the inner conductor in cell 201. A similar 
twist is provided for the horizontal conductors. The 
twist is typically implemented by forming conductor 
vias that allow the conductor to cross over the other 30 
conductors on a different conductor level, being sepa- 
rated therefrom by a dielectric. Thus, although only a 
subset of the clock routing conductors may be cross- 
connected in a given cell, all routing conductors can 
be driven by one or more cells inside the entire FPGA. 35 
(This assumes that the number of rows and/or col- 
umns in the array, with one twist per cell, is at least 
equal to the number of the conductors (N) in each 
group.) However, the present invention may be prac- 
ticed without twists by simply increasing the number 40 
of cross-connects that connect the horizontal and 
vertical groups. Then, more than one conductor in a 
given cell may be driven from the conductors in the 
other axis. 

One embodiment of the programmable cross- 45 
connect used in the above technique is illustrated in 
Fig. 3. The horizontal conductors 301 ... 304 are con- 
nected to the inputs of buffers 305 ... 308, which drive 
pass transistors 309 ... 312, respectively. The pass 
transistors form transmission gates that are control- 50 
led by programmable means 313 ... 316, which place 
either a high or low logic level on the gates of the pass 
transistors, thereby allowing a signal to pass or b 
bl eked, respectively. Th programmable means 
313 ... 316 are typically static registers that are soft- 55 
ware-controlled by signals supplied to the FPGA chip 
by an external source, according to techniques known 
inth arLTh programmable means may alternative- 



ly comprises a ROM (read only memory), EEPROM 
(electrically erasable, programmabl ROM), static 
random access memory, or other storage device. Th 
outputs of the pass transistors are conn cted to th 
input of driver 340, which provides a high driv capa- 
bility so as to driv the vertical conductor 323 at high 
speed with minimum delay, to minimize clock skew. 
Driver 340 is connected to conductor 323 by pass 
transistor 341, which is controlled by programmable 
means 342. In a similar manner, the vertical conduc- 
tors 320 ... 323 may provide a clock signal onto th 
horizontal conductor 301 by means of buffers 324 ... 
327, pass transistors 328 ... 331, programmabl 
means 332 ... 335, driver 317, pass transistor 318, 
and programmable means 319. 

Note that in Fig. 3 the conductors in a given axis 
are continuous through the programmable conn c- 
tors; only the cross-connects from one axis to another 
include buffers and transmission gates. Hence, the 
clock skew introduced by the programmable cross- 
connects is minimized. Also included in Fig. 3 are 
pass transistor 338 and programmable means 339. 
These allow the signal from any of the horizontal con- 
ductors 301 ... 304 that are selected by the program- 
mable means 313 ... 316 to be supplied directly to the 
clock (CK) input of the PFU in the same cell as the 
programmable cross-connect circuitry. Similarly, the 
pass transistor 336 and programmable means 337 al- 
low the signal from any of the vertical conductors 
320 ... 323 to be supplied to the CK input; see also 
Fig. 2 in this regard. 

The present technique allows for distributing glo- 
bal signals from a bondpad to the PFUs with various 
degrees of skew, depending on the needs of the cir- 
cuit To minimize the skew, a single global conductor 
in a given (e.g., vertical) axis that carries the global 
signal drives multiple global conductors located in the 
other (e.g., horizontal) axis. Then, the individual 
PFUs are driven from the global conductors in the 
horizontal axis. Furthermore, the lowest skew is typ- 
ically obtained if the single global conductor in the 
vertical axis is centrally located. This vertical global 
conductor may be driven directly from a bondpad. 
However, even lower skew is possible if the vertical 
conductor is in turn driven at its mid-point through a 
programmable cross-connect from a horizontal con- 
ductor that is driven from the bondpad. That results 
in the most symmetrical signal distribution, and hence 
lowest skew, but with a slight increase in signal laten- 
cy. Alternatively, the skew is in practice still relatively 
low even when the single vertical conductor is located 
along an edge of the array, and driven directly from a 
b ndpad. It is alternate ly possibl to us only a sin- 
gl global conductor, which distributes the global sig- 
nal to local conductors, which in turn d liver the signal 
to th individual PFUs. How ver, the local conductors 
introduce additional d lay, sine they are not contin- 
uous conductors, but include pass transistors used as 
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transmission gates. Hence, the skew is typically 
somewhat higher than in the previously-described 
cases. 

It can be seen that with the present invention, the 
connections between the horizontal and vertical rout- 
ing conductors in each axis are configurable, whereby 
only the needed routing conductors will be occupied 
with a given signal. This allows the unused dock rout- 
ing conductors to remain free to route other signals. 
Also, the buffer that is provided in the cross-connect 
between the vertical and horizontal routing conduc- 
tors supplies a fast, non- degraded clock signal to the 
PFUs. In the above embodiment, any of the global 
routing conductors in one axis can be used to drive 
the global routing conductors in the other axis with 
approximately the same effectiveness. Therefore, 
any global routing conductor can be used as a clock 
source. If every user I/O pad on the device has the 
same connectivity to at least one of these global rout- 
ing conductors, then every I/O pad can be used as a 
clock pin. Thus the need for dedicated clock pins for 
an FPGA is alleviated. Although clock signals are typ- 
ical of the types of signals that can be routed with the 
present invention, other types of global signals may 
also benefit from the use of the present technique. 
These signals include, for example, register resets, 
clock enables, tri-state enables, and memory write 
signals, with still others being possible. 

In a presently-preferred embodiment, all of the 
signal (non-power supply) bondpads of the FPGA 
chip are connected to programmable input buffers 
(e.g., 222 ... 225, etc.). This allows for great flexibility 
in supplying the global signals to the FPGA. In addi- 
tion, these bondpads typically also supply other 
types of signals that may be selected when the glo- 
bal-type input signals are de-selected. For example, 
referring to Fig. 2, the input buffer 270 may be pro- 
grammed to drive the global line 271. Alternatively, 
the input buffer 270 may be programmed to drive the 
local line 272. In that case, the global line may be de- 
selected, or alternatively may remain connected if de- 
sired. It is also possible to drive the global line from a 
PFU when the input buffer is not programmed to drive 
the global line from a bond pad. 

Alternatively, it is possible to implement the in- 
vention without connecting all of the signal bondpads 
to the global input buffers, whereby some of the bond- 
pads do not supply global signals. For example, bond- 
pads 266 ... 269 may be dedicated to non-global 
uses. However, to maximize the benefits of the inven- 
tive technique, we recommend that at least half of the 
signal bondpads on the FPGA chip be connected to 
the programmable global input buffers. Note that at 
least some of th bondpads also supply output sig- 
nals from the integrated circuit, which function may b 
combined with supplying input signals using I/O buf- 
fers known in the art Still other variations of the pres- 
ent technique are possible. 



Claims 

1. An integrated circuit comprising an array of pro- 
grammable logic cells (eg., 201 - 204) and bond- 

5 pads for supplying input signals; wherein each 

cell (eg., 201) comprises routing conductors and 
a programmable logic element (eg., 206); 

Characterized in that said integrated cir- 
cuit comprises global routing conductors, with a 

10 first group of global routing conductors (eg., 234 
- 237) lying parallel to a first axis of the array, and 
a second group of global routing conductors (eg., 
214 - 217) lying parallel to a second axis; 

wherein each of said global routing con- 

15 ductors is continuous across a portion of said ar- 
ray that spans at least two logic cells; and con- 
nected to a bondpad (eg., 238 - 241) through a 
programmable"iriput buffer (eg., 242 - 245); 

and wherein a programmable cross- 

20 connect (eg., 257 - 265) provides for programma- 
bly connecting at least one conductor of said first 
group and a conductor of said second group. 

2. The integrated circuit of claim 1 wherein at least 
25 some of said global routing conductors are each 

connected at both ends to bondpads through pro- 
grammable input buffers located on opposite 
sides of said integrated circuit 

30 3. The integrated circuit of claim 1 wherein all of said 
global routing conductors are each connected at 
both ends to bondpads through programmable 
input buffers located on opposite sides of said in- 
tegrated circuit 

35 

4. The integrated circuit of claim 1 wherein at least 
some of said global routing conductors are each 
connected at only one end to a bondpad through 
a programmable input buffer. 

40 

5. The integrated circuit of claim 1 wherein all of t he 
signal bondpads on said integrated circuit are 
connected to programmable input buffers that 
programmably connect to said global routing con- 

45 ductors. 

6. The integrated circuit of claim 1 wherein at least 
half of the signal bondpads on said integrated cir- 
cuit are connected to programmable input buffers 

50 that programmably connect to said global routing 
conductors. 

7. The integrated circuit of claim 1 further compris- 
ing programmabi connection means for pro- 

55 grammably connecting at least one of said global 
routing c nductors to said programmable logic 
element 



30 



35 



40 



55 



9 EP 0 596 658 A1 10 

8. The integrated circuit of claim 7 wh rein said at 
least one of said global routing conductors sup- 
plies a global signal to said programmable logic 
element 

5 

9. The integrated circuit of claim 8 wherein said glo- 
bal signal is a clock signal. 

1 0. The integrated circuit of claim 7 wherein said pro- 
grammable logic element supplies an internally- 10 
generated global signal to said at least one of said 
global routing conductors. 

11. The integrated circuit of claim 10 wherein said in- 
ternally-generated global signal is a clock signal. 15 
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